rank | frequency | n-gram |
---|---|---|
1 | 1838 | -ة |
2 | 818 | -ت |
3 | 808 | -ا |
4 | 568 | -ن |
5 | 560 | -م |
rank | frequency | n-gram |
---|---|---|
1 | 627 | -ات |
2 | 586 | -ية |
3 | 377 | -ها |
4 | 244 | -ين |
5 | 165 | -رة |
rank | frequency | n-gram |
---|---|---|
1 | 98 | -يات |
2 | 94 | -رات |
3 | 84 | -تها |
4 | 61 | -رية |
5 | 56 | -نية |
rank | frequency | n-gram |
---|---|---|
1 | 38 | -ارات |
2 | 26 | -اتها |
3 | 21 | -انية |
4 | 19 | -اعات |
5 | 19 | -ليات |
rank | frequency | n-gram |
---|---|---|
1 | 10 | -ستخدم |
2 | 8 | -ليمية |
3 | 8 | -إدارة |
4 | 8 | -لومات |
5 | 7 | -تخدام |
The tables show the most frequent letter-N-grams at the ending of words for N=1…5. Everything runs in parallel to 2.2.5 Most frequent word beginnings. The aim is suffix detection instead of affix detection.
For N=3:
SELECT @pos:=(@pos+1), xx.* from (SELECT @pos:=0) r, (select count(*) as cnt ,concat("-", right(word,3)) FROM words WHERE w_id>100 group by right(word,3) order by cnt desc) xx limit 5;
2.2.5 Most frequent word beginnings